Search CORE

60 research outputs found

Conic Multi-Task Classification

Author: A. Argyriou
A. Maurer
A. Maurer
A. Maurer
A. Rakotomamonjy
G. Obozinski
M. Kloft
P. Gong
R. Caruana
S.M. Kakade
W. Samek
Publication venue
Publication date: 01/01/2014
Field of study

Traditionally, Multi-task Learning (MTL) models optimize the average of task-related objective functions, which is an intuitive approach and which we will be referring to as Average MTL. However, a more general framework, referred to as Conic MTL, can be formulated by considering conic combinations of the objective functions instead; in this framework, Average MTL arises as a special case, when all combination coefficients equal 1. Although the advantage of Conic MTL over Average MTL has been shown experimentally in previous works, no theoretical justification has been provided to date. In this paper, we derive a generalization bound for the Conic MTL method, and demonstrate that the tightest bound is not necessarily achieved, when all combination coefficients equal 1; hence, Average MTL may not always be the optimal choice, and it is important to consider Conic MTL. As a byproduct of the generalization bound, it also theoretically explains the good experimental results of previous relevant works. Finally, we propose a new Conic MTL model, whose conic combination coefficients minimize the generalization bound, instead of choosing them heuristically as has been done in previous methods. The rationale and advantage of our model is demonstrated and verified via a series of experiments by comparing with several other methods.Comment: Accepted by European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECMLPKDD)-201

arXiv.org e-Print Archive

Crossref

University of Central Florida (UCF): STARS (Showcase of Text, Archives, Research & Scholarship)

Improved extended-range prediction of persistent stratospheric perturbations using machine learning

Author: D. I. V. Domeisen
D. I. V. Domeisen
E. Székely
G. Obozinski
R. de Fondeville
Z. Wu
Publication venue: 'Copernicus GmbH'
Publication date: 01/04/2023
Field of study

On average every 2 years, the stratospheric polar vortex exhibits extreme perturbations known as sudden stratospheric warmings (SSWs). The impact of these events is not limited to the stratosphere: but they can also influence the weather at the surface of the Earth for up to 3 months after their occurrence. This downward effect is observed in particular for SSW events with extended recovery timescales. This long-lasting stratospheric impact on surface weather can be leveraged to significantly improve the performance of weather forecasts on timescales of weeks to months. In this paper, we present a fully data-driven procedure to improve the performance of long-range forecasts of the stratosphere around SSW events with an extended recovery. We first use unsupervised machine learning algorithms to capture the spatio-temporal dynamics of SSWs and to create a continuous scale index measuring both the frequency and the strength of persistent stratospheric perturbations. We then uncover three-dimensional spatial patterns maximizing the correlation with positive index values, allowing us to assess when and where statistically significant early signals of SSW occurrence can be found. Finally, we propose two machine learning (ML) forecasting models as competitors for the state-of-the-art sub-seasonal European Centre for Medium-Range Weather Forecasts (ECMWF) numerical prediction model S2S (sub-seasonal to seasonal): while the numerical model performs better for lead times of up to 25 d, the ML models offer better predictive performance for greater lead times. We leverage our best-performing ML forecasting model to successfully post-process numerical ensemble forecasts and increase their performance by up to 20 %.</p

Directory of Open Access Journals

Multi-Target Prediction: A Unifying View on Problems and Methods

Multi-target prediction (MTP) is concerned with the simultaneous prediction of multiple target variables of diverse type. Due to its enormous application potential, it has developed into an active and rapidly expanding research field that combines several subfields of machine learning, including multivariate regression, multi-label classification, multi-task learning, dyadic prediction, zero-shot learning, network inference, and matrix completion. In this paper, we present a unifying view on MTP problems and methods. First, we formally discuss commonalities and differences between existing MTP problems. To this end, we introduce a general framework that covers the above subfields as special cases. As a second contribution, we provide a structured overview of MTP methods. This is accomplished by identifying a number of key properties, which distinguish such methods and determine their suitability for different types of problems. Finally, we also discuss a few challenges for future research

arXiv.org e-Print Archive

Crossref

Ghent University Academic Bibliography

Low Complexity Regularization of Linear Inverse Problems

Author: A Girard
A. Barron
A. Beck
A. Beck
A. Chambolle
A. Chambolle
A. Daniilidis
A. Montanari
A.N. Tikhonov
A.N. Tikhonov
A.S. Bandeira
A.S. Lewis
A.S. Lewis
A.S. Lewis
A.S. Lewis
B. Efron
B. Efron
B. Recht
B.A. Turlach
B.C. Vũ
B.D. Rao
B.F. Svaiter
B.K. Natarajan
B.S. Mordukhovich
C. Chaux
C. Deledalle
C. Dossal
C. Dossal
C. Lemaréchal
C. Vonesch
C.-A. Deledalle
C.-A. Deledalle
C.L. Mallows
C.M. Stein
D. Gabay
D. Gabay
D. Gross
D. Needell
D. Ville Van De
D. Ville Van De
D.A. Lorenz
D.A. Spielman
D.L. Donoho
D.L. Donoho
D.L. Donoho
D.L. Donoho
E. Grave
E. Hale
E. Harchaoui
E. J. Candès
E. Richard
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
E.J. Candès
F. Bach
F. Bach
F. Luisier
F. Santosa
G. Chen
G. Davis
G. Obozinski
G. Peyré
G. Steidl
G.B. Passty
G.H. Golub
H. Akaike
H. Jégou
H. Jégou
H. Raguet
H. Zou
H.H. Bauschke
H.H. Bauschke
H.L. Taylor
H.M. Hudson
I. Daubechies
J. Allen
J. Bolte
J. Chen
J. Douglas
J. Eckstein
J. Eckstein
J. Eckstein
J. Mairal
J. Tropp
J. Ye
J.-B. Hiriart-Urruty
J.-C. Pesquet
J.-F. Aujol
J.-J. Fuchs
J.-J. Fuchs
J.-L. Starck
J.-L. Starck
J.A. Tropp
J.C. Dunn
J.E. Vogt
J.F. Claerbout
J.M. Lee
K. Bredies
K. Bredies
K. Bredies
K. Kato
K. Knight
K.-C. Li
L. Birgé
L. Birgé
L. Borup
L. Condat
L.I. Rudin
L.M. Briceño Arias
M Coste
M. Elad
M. Elad
M. Fazel
M. Fortin
M. Frank
M. Golbabaee
M. Grasmair
M. Jaggi
M. Meyer
M. Rudelson
M. Yuan
M.J. Wainwright
M.R. Osborne
M.V. Solodov
N. Parikh
O. Scherzer
P. Hall
P. Hall
P. Tseng
P. Zhao
P. Zhao
P.L. Combettes
P.L. Combettes
P.L. Combettes
P.L. Lions
R. Ciak
R. Giryes
R. Glowinski
R. Gribonval
R. Gribonval
R. Gribonval
R. Gribonval
R. Jenatton
R. Jenatton
R. Refregier
R. Tibshirani
R. Tibshirani
R.J. Tibshirani
R.J. Tibshirani
R.L. Dykstra
R.T. Rockafellar
S. Nam
S. Negahban
S. Ramani
S. Ramani
S. Shalev-Shwartz
S. Vaiter
S. Vaiter
S.F. Cotter
S.G. Lingala
S.G. Mallat
S.G. Mallat
S.G. Mallat
S.J. Wright
S.N. Negahban
S.P. Boyd
S.S. Chen
T. Blu
T. Blumensath
T. Strohmer
T.T. Cai
T.T. Cai
T.T. Cai
V. Chandrasekaran
V. Duval
V. Solo
W.L. Hare
X. Shen
Y. Castro de
Y. Censor
Y. Chen
Y. Lyubarskii
Y. Nesterov
Y. Nesterov
Y.C. Eldar
Y.C. Pati
Publication venue
Publication date: 01/01/2014
Field of study

Inverse problems and regularization theory is a central theme in contemporary signal processing, where the goal is to reconstruct an unknown signal from partial indirect, and possibly noisy, measurements of it. A now standard method for recovering the unknown signal is to solve a convex optimization problem that enforces some prior knowledge about its structure. This has proved efficient in many problems routinely encountered in imaging sciences, statistics and machine learning. This chapter delivers a review of recent advances in the field where the regularization prior promotes solutions conforming to some notion of simplicity/low-complexity. These priors encompass as popular examples sparsity and group sparsity (to capture the compressibility of natural signals and images), total variation and analysis sparsity (to promote piecewise regularity), and low-rank (as natural extension of sparsity to matrix-valued data). Our aim is to provide a unified treatment of all these regularizations under a single umbrella, namely the theory of partial smoothness. This framework is very general and accommodates all low-complexity regularizers just mentioned, as well as many others. Partial smoothness turns out to be the canonical way to encode low-dimensional models that can be linear spaces or more general smooth manifolds. This review is intended to serve as a one stop shop toward the understanding of the theoretical properties of the so-regularized solutions. It covers a large spectrum including: (i) recovery guarantees and stability to noise, both in terms of

\ell^2

-stability and model (manifold) identification; (ii) sensitivity analysis to perturbations of the parameters involved (in particular the observations), with applications to unbiased risk estimation ; (iii) convergence properties of the forward-backward proximal splitting scheme, that is particularly well suited to solve the corresponding large-scale regularized optimization problem

arXiv.org e-Print Archive

HAL - Normandie Université

CiteSeerX

Base de publications de l'université Paris-Dauphine

Crossref

Predicting Future Clinical Changes of MCI Patients Using Longitudinal and Multimodal Biomarkers

Author: A Argyriou
A Convit
A Convit
AM Fjell
AT Du
B Ron
C Davatzikos
C Davatzikos
C Hinrichs
C Misra
CC Chang
CM Stonnington
CR Jack
D Shen
D Shen
D Shen
D Zhang
D Zhang
Daoqiang Zhang
Dinggang Shen
DW Shattuck
FH Bouwman
G Chetelat
G Obozinski
G Wu
H Wang
J Golomb
J Liu
JC Morris
JG Sled
Kewei Chen
KK Leung
LK McEvoy
LM Shaw
MJ de Leon
N Fox
N Kabani
N Mattsson
R Cuingnet
R Tibshirani
S De Santi
S Duchesne
SM Smith
Y Cho
Y Fan
Y Fan
Y Fan
Y Li
Y Wang
Y Zhang
Z Xue
Z Xue
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

Accurate prediction of clinical changes of mild cognitive impairment (MCI) patients, including both qualitative change (i.e., conversion to Alzheimer's disease (AD)) and quantitative change (i.e., cognitive scores) at future time points, is important for early diagnosis of AD and for monitoring the disease progression. In this paper, we propose to predict future clinical changes of MCI patients by using both baseline and longitudinal multimodality data. To do this, we first develop a longitudinal feature selection method to jointly select brain regions across multiple time points for each modality. Specifically, for each time point, we train a sparse linear regression model by using the imaging data and the corresponding clinical scores, with an extra ‘group regularization’ to group the weights corresponding to the same brain region across multiple time points together and to allow for selection of brain regions based on the strength of multiple time points jointly. Then, to further reflect the longitudinal changes on the selected brain regions, we extract a set of longitudinal features from the original baseline and longitudinal data. Finally, we combine all features on the selected brain regions, from different modalities, for prediction by using our previously proposed multi-kernel SVM. We validate our method on 88 ADNI MCI subjects, with both MRI and FDG-PET data and the corresponding clinical scores (i.e., MMSE and ADAS-Cog) at 5 different time points. We first predict the clinical scores (MMSE and ADAS-Cog) at 24-month by using the multimodality data at previous time points, and then predict the conversion of MCI to AD by using the multimodality data at time points which are at least 6-month ahead of the conversion. The results on both sets of experiments show that our proposed method can achieve better performance in predicting future clinical changes of MCI patients than the conventional methods

Public Library of Science (PLOS)

Crossref

PubMed Central

Carolina Digital Repository

Predicting gene function using hierarchical multi-label decision tree ensembles

Author: A Clare
A Clare
A Clare
B Hayete
C Vens
Celine Vens
D Kocev
Dragi Kocev
E Zdobnov
F Provost
F Wilcoxon
G Obozinski
GR Lanckriet
H Blockeel
H Blockeel
H Blockeel
H Chua
H Drucker
H Lee
H Mewes
Hendrik Blockeel
J Davis
J Gough
J Quinlan
J Rousu
J Struyf
Jan Struyf
L Breiman
L Breiman
L Breiman
L Breiman
L Pena-Castillo
Leander Schietgat
M Ashburner
M Deng
M Ouali
N Cesa-Bianchi
O Troyanskaya
R Caruana
S Altschul
S Mostafavi
Sašo Džeroski
T Hughes
T Joachims
U Karaoz
W Kim
W Tian
Y Chen
Y Guan
Z Barutcuoglu
Publication venue: BioMed Central
Publication date: 01/01/2010
Field of study

Abstract Background <it>S. cerevisiae</it>, <it>A. thaliana </it>and <it>M. musculus </it>are well-studied organisms in biology and the sequencing of their genomes was completed many years ago. It is still a challenge, however, to develop methods that assign biological functions to the ORFs in these genomes automatically. Different machine learning methods have been proposed to this end, but it remains unclear which method is to be preferred in terms of predictive performance, efficiency and usability. Results We study the use of decision tree based models for predicting the multiple functions of ORFs. First, we describe an algorithm for learning hierarchical multi-label decision trees. These can simultaneously predict all the functions of an ORF, while respecting a given hierarchy of gene functions (such as FunCat or GO). We present new results obtained with this algorithm, showing that the trees found by it exhibit clearly better predictive performance than the trees found by previously described methods. Nevertheless, the predictive performance of individual trees is lower than that of some recently proposed statistical learning methods. We show that ensembles of such trees are more accurate than single trees and are competitive with state-of-the-art statistical learning and functional linkage methods. Moreover, the ensemble method is computationally efficient and easy to use. Conclusions Our results suggest that decision tree based methods are a state-of-the-art, efficient and easy-to-use approach to ORF function prediction.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

Leiden University Scholary Publications

A critical assessment of Mus musculus gene function prediction using integrated genomic evidence

Background: Several years after sequencing the human genome and the mouse genome, much remains to be discovered about the functions of most human and mouse genes. Computational prediction of gene function promises to help focus limited experimental resources on the most likely hypotheses. Several algorithms using diverse genomic data have been applied to this task in model organisms; however, the performance of such approaches in mammals has not yet been evaluated. Results: In this study, a standardized collection of mouse functional genomic data was assembled; nine bioinformatics teams used this data set to independently train classifiers and generate predictions of function, as defined by Gene Ontology (GO) terms, for 21,603 mouse genes; and the best performing submissions were combined in a single set of predictions. We identified strengths and weaknesses of current functional genomic data sets and compared the performance of function prediction algorithms. This analysis inferred functions for 76% of mouse genes, including 5,000 currently uncharacterized genes. At a recall rate of 20%, a unified set of predictions averaged 41% precision, with 26% of GO terms achieving a precision better than 90%. Conclusion: We performed a systematic evaluation of diverse, independently developed computational approaches for predicting gene function from heterogeneous data sources in mammals. The results show that currently available data for mammals allows predictions with both breadth and accuracy. Importantly, many highly novel predictions emerge for the 38% of mouse genes that remain uncharacterized

University of Toronto Research Repository

Crossref

Springer - Publisher Connector

PubMed Central

eScholarship - University of California

Warwick Research Archives Portal Repository

PORTO@iris (Publications Open Repository TOrino - Politecnico di Torino)

PORTO Publications Open Repository TOrino

A Genome-Wide Gene Function Prediction Resource for Drosophila melanogaster

Author: A Liaw
A Statnikov
A Vazquez
AC Edwards
AC Gavin
AJ Walhout
AM Johansson
B Estrada
C Stark
CJ Echeverri
CL Myers
David E. Hill
DB Johnson
EM Marcotte
Frederick P. Roth
G Obozinski
GE Carney
GW Muse
H Agaisse
H Yu
Han Yan
HJ Lee
HL Liang
HN Chua
I Carrera
I Flockhart
IH Witten
J Beaver
J Jemc
J Reboul
J Wang
J Yu
JC Costello
JF Rual
JG Mezey
JG Sorensen
John E. Beaver
JZ Maines
K Venkatesan
KA Boltz
Kavitha Venkatesan
KC Gunsalus
KD Pruitt
KE Weber
L Breiman
L Giot
LC Firth
M Ashburner
M Johnson
M Kanehisa
M Tasan
M Umemori
Marc Vidal
ME Cusick
Michael E. Cusick
ML Whitfield
MN Arbeitman
Muhammed A. Yildirim
N Robine
N Simonis
NA Terry
Nicholas James Provart
Niels Klitgord
NJ Mulder
Norbert Perrimon
P Braun
P Mourikis
P Muller
P Tomancak
P Uetz
R Sharan
RB Beckstead
RJ Wilson
RL Tatusov
S Aerts
S Li
SB Kotsiantis
T Brody
T Ito
Tong Hao
V Reinke
W Tian
X Deng
X Deng
X Deng
X Qin
X Wang
X Wu
Y Ho
Publication venue: Public Library of Science
Publication date: 01/08/2010
Field of study

Predicting gene functions by integrating large-scale biological data remains a challenge for systems biology. Here we present a resource for Drosophila melanogaster gene function predictions. We trained function-specific classifiers to optimize the influence of different biological datasets for each functional category. Our model predicted GO terms and KEGG pathway memberships for Drosophila melanogaster genes with high accuracy, as affirmed by cross-validation, supporting literature evidence, and large-scale RNAi screens. The resulting resource of prioritized associations between Drosophila genes and their potential functions offers a guide for experimental investigations

Public Library of Science (PLOS)

CiteSeerX

Crossref

Directory of Open Access Journals

PubMed Central

Learning Sparse FRAME Models for Natural Image Patterns

Author: A Adler
A Gelman
AM Bruckstein
AP Dempster
BA Olshausen
DH Ackley
G Obozinski
GE Hinton
GE Hinton
J Tropp
JH Friedman
Jianwen Xie
L Younes
M Aharon
M Elad
M Elad
M Riesenhuber
P Felzenszwalb
R Neal
R Rubinstein
R Tibshirani
RE Fan
S Duane
S Geman
S Geman
S Mallat
S Nama
SC Zhu
SC Zhu
Song-Chun Zhu
SS Chen
Wenze Hu
Y Hong
Ying Nian Wu
Z Si
Publication venue: 'Springer Science and Business Media LLC'
Publication date
Field of study

Crossref